Automatic Validation of Terminology by Means of Formal Concept Analysis

نویسندگان

  • Luis Felipe Melo Mora
  • Yannick Toussaint
چکیده

Term extraction tools extract candidate terms and annotate their occurrences in the texts. However, not all these occurrences are terminological and, at present, this is still a very challenging issue to distinguish when a candidate term is really used with a terminological meaning. The validation of term annotations is presented as a bi-classification model that classifies each term occurrence as a terminological or non-terminological occurrence. A context-based hypothesis approach is applied to a training corpus: we assume that the words in the sentence which contains the studied occurrence can be used to build positive and negative hypotheses that are further used to classify undetermined examples. The method is applied and evaluated on a french corpus in the linguistic domain and we also mention some improvements suggested by a quantitative and qualitative evaluation.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

طراحی سامانه نیمه‌خودکار ساخت هستی‌شناسی به‌کمک تحلیل هم‌رخدادی واژگان و روش C-value (مطالعه موردی: حوزه علم‌سنجی ایران)

Ontology is one of formal concepts and the relations in the specific regions.It have recently tried to design the learning, automatic methods of Ontology. Whereas Ontology containing concepts and the relations, exploiting concepts, the semantic relations among concept. The various Ontology of regions and different applications are expensive processes that are automatic.The lack of main knowledg...

متن کامل

Browsing Search Results via Formal Concept Analysis: Automatic Selection of Attributes

This paper presents the JBraindead Information Retrieval System, which combines a free-text search engine with online Formal Concept Analysis to organize the results of a query. Unlike most applications of Conceptual Clustering to Information Retrieval, JBraindead is not restricted to specific domains, and does not use manually assigned descriptors for documents nor domain specific thesauruses....

متن کامل

A Semi-automatic Method to Ontology Design by Using FCA

Ontology design is a complex and time-consuming process. It is extremely difficult for human experts to discover ontology from given data or texts. This paper presents a semi-automatic method for ontology extraction and design. The method is based on Formal Concept Analysis and a Horn clause model of a concept lattice. Inputs to the technique are domain-specific texts or data. After transformat...

متن کامل

فرایند تعلیم و تربیت با اقتباس از مبانی عرفان اسلامی

 This study intends to explain the process of education, based on the educational foundations of Islamic mysticism. Indeed, this investigation tries to answer the question "what is the process of education based on Islamic mysticism?" In this study the analytic - deductive method is used to respond to the above question. At first, the ontological, epistemological and humanistic foundations of I...

متن کامل

Semi-automatic Construction of Ontology-based CBR System for Knowledge Integration

In order to integrate knowledge in heterogeneous case-based reasoning (CBR) systems, ontology-based CBR system has become a hot topic. To solve the facing problems of ontology-based CBR system, for example, its architecture is nonstandard, reusing knowledge in legacy CBR is deficient, ontology construction is difficult, etc, we propose a novel approach for semi-automatically construct ontology-...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015